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SEQUENCE LISTING 



Merck Patent GmbH 



<120> Glucose dehydrogenase fusion proteins and their 
use in expression systems 

<130> 9906920-3z-mi 

<140> 
<141> 

<160> 16 

<17Q> Patentln* Ver. 2.1 

<21C> 1 

<211> 3992 

<212> DNA 

<2I2> 3aciiius megaterium 

<220> 

<221> CDS 

<222> (136) . . (963) 

<223> Glucose degydrogenase from Bacillus megaterum 

<220> 
<221> CDS 

<222> (978) . . (1010) 

<222> Poly-histidine tag 

<22C> 

<221> gene 

<222> (I; . . - 3992) 

<2 23> Plasniti ?AW2 

<40C> 1 

ccatcgaatg gccagatgat taat.ectaa tttttgttga taettra^ca ztgatagagt 6C 

tattttacta ctctctatca gtgatagaga aaagtgaaat gaatagttcg .acaaaaaz:: 120 

aga-aacgag ggcaarcgat gaattcgace zcqqzaczzq gqczzccczc gaggtcgace 130 

tgcag atg tat aca gat tta aaa gat aaa gta gtt gta att aca ggt gga 23C 
.Met Tyr Thr Asp Leu Lys Asp Lys Val Val Val lie Thr Gly 31 y 

tea aca ggt eta gga cge gca atg get gtt cgt ttc ggt caa gaa qaa 279 
Ser Thr Gly Leu Gly Arg Ala Met A_a Val Arg ?he Gly Gin C-lu Glu 
20 25 30 

aca aaa gtt gtt att aa: cat tac aac aat gaa gaa gaa get eta gat 326 
Ala Lys Val Val lie As r. Tyr Tyr Asn Asn Glu Glu Glu Ala Leu Asp 



gcg aaa aaa gaa gta gaa gaa gca ggc gga caa gca ace ate gtt caa 374 
Ala Lys Lys Glu Val Glu Glu Ala Gly Gly Gin Ala He lie Val Gin 
50 55 60 
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ggc gat gta aca aaa gaa gaa gac gtt gta aa: c:t gtt caa aca gc 
Giy Asp Val Thr Lys Glu Glu As? Val Val Asn Leu Val Gin Thr Ai 
65 70 7 5 



att aaa gaa ttt ggt aca tta gac 
lie Lys GIu Phe Gly thr Leu Asp 
80 85 

gaa aac cca gtt cct tct car gag 
GIu Asn Pro Val Pro Ser His Glu 
100 



gta atg att aac aac get ggt gtt 470 
Val Met He Asn Asn Ala Gly Val 
90 95 

eta tct eta gat aac tgg aac aaa 518 
Leu Ser Leu Asp Asn Trp Asn Lys 
105 " 110 



gtt att gat aca aac tta 
Val He Asp Thr Asn Leu 
115 

att aaa tac ttc gtt gaa 
lie Lvs Tyr Phe Val Glu 
130 

tct age gtt cae gaa atg 
Ser Ser Val His Glu Met 
145 

gca agt aaa gcc ggt atg 
Ala Ser Lys Gly Gly Met 
160 * 165 



aca ggt gca ttc tta gga 
Thr Gly Ala Phe Leu Gly 
120 

aac gac att aaa gga aat 
Asn Aso lie Lvs Gly Asn 
125 

att cct tgg cca tta ttt 
lie Pro Trp Pro Leu Phe 
150 " 155 

aaa eta atg acg gaa aca 
Lys Leu Met Thr Glu Thr 
170 



age cgt gaa gca 566 
Ser ' Arc Glu Ala 
125 

gtt ate aac atg 614 

Vai lie Asn Met 
140 

gtt cac tac gca 662 
Val His Tyr Ala 



ttg get ctt gaa 710 
Leu Ala Leu Glu 
175 



tat gcg cca aaa ggt ate cgc gta aat aat att gga cca ggt gcg atg 758 

Tyr Ala Pro Lys Gly He Arg Vai Asn Asn lie Gly Pro Gly Ala Met 

180 185 190 

aac aca cca att aac gca gag aaa ttt gca gat cca gaa caa cgt gca 306 

Asn Thr Pro He Asn Ala Glu Lvs Phe Ala Asp Pro Glu Gin Arg Ala 

155 " 200 205 

gac gta gaa age arc att tea atg ggt tac ate ggt aaa cca gaa gaa 854 

Aso Val Glu Ser Met lie Pro Met Gly Tyr He Gly Lys Pre GIu GIu 

210 215 ' * 220 

gta gca gca gtt gca gca ::: tta gc-. cca tea caa gca age tat gta .902 

Val Ala Ala Val Ala Ala Phe Leu Ala Ser Ser Gin Ala Ser Tyr Val 
225 23C 235 



aca ggt att aca 
" i ar 



240 



:ta ttt gca gat cgc ggt atg acg aaa tac cct tct 
.eu Phe Ala Asp Giv Gly Met Thr Lys Tyr Pro Ser 
245 ' " 250 255 



950 



jca gga aga ggc taatagagc get atg aga gga 



cat cac cat 



L001 



Phe Gin Ala Gly Arg Gly 
260 



^ia Met Arc 



Sly Ser His His His 



cac cat cac taatagaagc ttgacctgtg aagtgaaaaa tgg cgc aca t IQzO 

His His His 

270 

tgtgcgacat tttttttgtt tgccgtttac egctac-gcg tcaeggatct ccaegcgccc 1110 
tgtagcggcg cattaagege ggcgggtgtg gtggttacgc geagegtgae cgctacactt 1170 



O «3 «1 A iliJMi , O S ,1. 6 O :1 



3 

gccagcgccc tagcgcccgc tcctctcgc-. ttc-.ccs«: ccrtccccgc cacgctcgcc 1230 . 
ggcrctcccc gtcaagccc-- aaa-cggggg «cc=".tag ggtc=cgatt: tag-gc"-a 1290 
cggcacctcg accccaaaaa acctgactag ggngaegg« cacgcagtgg gccatcgccc 1350 
tgatagacgg ttrccsgeac tczgaegntg gagtc=acgt -.r.tcaaug tggactcttg 1410 
tcseaaictg gaacaacact caa=ccta=c tcggtcratt ct.rcgattt anaagggatr 1470 
ttgccgaccr cggc=ca"g g—aaaaaat gagctgar.^ aacaaaaatt taacgcgaat 1530 
tttaacaaaa tatcaaegct cacaatrzca ggcggcacc- treggggaaa tgtgcgcgga 1590 
acc=ctat-t gtrrac-ttt .craaazacac tcaaacatg- zzccgctcaz gagacaataa 1650 
ccccgacaaa .tgcKcaata acattgaaaa aggaagag-a tgag-.at-ca acasczccgr 1710 
g-.cgccctta «ccc««t zgcgg=a«-- --gc=«c«g =rt---.gcr=a cccagaaacg 1770 
ccgg-gaaag caaaagatgc rgaagar=ag "ggg-gcac gagrgggzza catcgaactg 1330 
garcccaaca gcggtaaga. ccc.gagacc t"cgc===g aagaacg-r cccaatga^g 1890 
agcaccttta aagrrc-gc- atgtggcgcg g««ac==c gzatrgacgc cgggcaagag 19S0 
caactcggtc gccgcataca ctat, : tcac aacgactcgg. trgagtacrc accagccaca 2010 
gaaaagcatc ttacggatgg cacgacagw agagaattat gcagtgc-gc cacaaccatg 2070 
agrgaeaaca c"-gcggccaa =ua:::::g acaacga-.cg gaggaccgaa ggagctaacc 2130 
g«««rg= acaacatggg ggaccatgca 3 c:=ccc::: atcg-.tggga ac=ggagc--g 2190 
aa-.gaagcca caccaaacga =gag=g-_gac accacgatgc «gragcaat ggcaacaacg 2250 
ttgcgcaaac rattaactgg cgaaccacrt a:.-.c^,. cggcaa.a ^.^ * 

~~ a - = -*rr- --,-^- a - — c-— -qcqc- cagcccttcc ggctggcrgg 2370 
rgcatggagg cggaca^ag _ ^g-ag^a^ < ^ - ^ 

ct=acng « g acaaatc-.gg agccgg.gag =g-.gcc,ccc gcggtatcat: tgcagca«g 2430 
gggccagatg g-.aagccc" ccg-.az=g« g>--.a-.c-.aca cgacggggag tcaggcaact 24 9C 
atggacgaac gaaacagaca gat=,c-.gac a«gc-.gc=-. =a«ga«aa gcattgg^ag 2 = 50 
g.actaacga tgrctcg---.-. tg.caaaag, aaag-.gat-.a ac.gcgcatr aglgczgctc 2610 
a.cg.ggtcg gaaccgaagg r.-.aacaacc =g«aa«cg c^agaagc, agg-.g«gag 267C 
cagcctacat tgtaccggca tgtaaaaaac aagcgggc" cg«cgacgc c«agccat= 2730 
gaga t g«ag ataggcacca ,ac,=acc,-. cgccc-.^.ag aaggggaaag ctggcaagat 2730 
tttrcacgta ataacgctaa aag"=»ga =grg=cc = ac -.aagtcatcg cgacggagca 285G 
aaagtaca- caggtacacg gcczacagaa 0 -ca,.av.g aaac -^ cgd 

_ , "a-3^crca: -atatgcact cagcgcagtg 2970 

gccttt::at gccaacaagg ^ a ^^^"^ ^ 
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gggcatttta ctttaggttg cgtattggaa ga-caagage aCcaagtcgc taaagaacaa 3030 

agggaaacac ctactactga tagtatgccg ccattattac gacaagctat cgaattattt 3050 

gatcaccaag gtgcagagcc agccttctta ttcggccttg aattgatcat atgcggatta 315C 

gaaaaacaac ttaaatgtga aagtgggt.cc taaaagcagc ataaccttc: tccgtgatgg 3210 

taacttcact agtttaaaag gatctaggtg aagatcctct ttgataatct catgaccaaa 3270 

atccct'aac gtgagtrttc gttccactga gcgtcagacc ccgtagaaaa gatcaaagga 3330 

tcttcttgag accctttttr tatgcgcgta atctgctgct tgcaaacaaa aaaaccaccg 3350 

ctaccagcgg tggtttgttt gccggatcaa gagc:accaa ctctttttcc gaaggtaact 3450 

ggcttcagca gagegcagat accaaanact gtccttctag tgtagccgta gttaggecac 3510 

cacttcaaga actctgragc accgcctaca tacctcgctc tgetaatect gttaccagtg 3570 

gctgctgcca gtggcgataa gtcgtgtctt accgggttgg actcaagacg atagttaccg 3530 

gataaggege ageggtegge ctgaacgggg ggttcgtgca cacagcccag cttggaccca 3550 

acgacctaca ccgaactgag atacctacag cgtgagctat gagaaagege cacgcttccc 3750 

gaagggagaa aggeggacag gtatccggta ageggcagge teggaacagg agagegcacg 3810 

agggagcttc cagggggaaa cgcctggtat ctttatagtc ctgtcgggtt tcgccacctc 3870 

tgacttgagc gtcgattttt gtgatgeteg tcaggggggc ggagcctatg gaaaaacgee 3930 

ageaacgegg cctttttacg gttcctggcc ttttgctggc ettttgetea catgacccga 3990 

ca 3992 



<210> 2 
<2H> 272 
<212> ?RT 

<213> Bacillus megaterium (GlcDH-polytag fusion protein) 
<400> 2 

Met Tyr Thr Asp Leu Lvs Aso Lvs Val Val Val. He Thr Gly Gly Ser 
1 5 " " 10 15 

Thr Gly Leu Gly Arg Ala Met Ala Val Arg ?he Gly Gin Glu Glu Ala 
2C 25 30 

Lys Val Val He Asn Tyr Tyr Asn As r. Glu Glu Glu Ala Leu Asp Ala 
35 * ■ 40 .45 

Lys Lys Glu Val Glu Glu Ala Gly Gly Gin Ala lie lie Val Gin Gly 



Asp Val Thr lys Glu Glu Asp Val Val Asr. Leu Val Gin Thr Ala He 
65 70 " 75 30 

Lys Glu ?he Gly Thr Leu Asp Val Met He Asn Asn Ala Gly Val Glu 
95 * 30 95 




Asn °ro Val Pro Ser His Glu Leu Ser Leu Asp Asn Tr? Asn Lys Val 
100 IDS 110 

He Asp Thr Asn Leu Thr Giy Ala ?he Leu Gly Ser Arg Giu Ala He 
115 120 125 

tvs Tvr Phe Val Glu Asn Asp lie Lys Gly Asn Val He Asn Met Ser 
130 135 1^0 

Se- Val His Glu Met He Pro Trp Pro Leu Phe Vai His Tyr Ala Ala 
14 i 150 135 ISO 

Ser Lvs Gly Gly Met Lvs Leu Met Thr Glu Thr Leu Ala Leu Giu Tyr 
- b i65 170 ^5 

Ala Pro Lys Gly He .Arg Val Asn Asn He Giy Pre Giy Ala Met Asn 



180 I 35 



190 



-w. b- 0 He Asn Ala Glu Lys Phe Ala Asp Pro Giu Gin Arc Ala Asp 
"~ X95 200 20o 

Val Giu Ser Met He Pre Met Giy Tyr He Giy ^ 

Ala Ala Vai Ala Ala Phe Leu Ala Ser Ser Gin Ala Ser Tyr Vai Thr 
22S 220 225 240 

Giy He Thr Leu Phe Ala Asp Gly Gly Met Thr Lys Tyr Pro Ser Phe 
245 25C 252 

Gin Ala Giy Arg Giy Ala Met Arg Giy Ser His His His His His His 
260 255 " 7 0 



<210> 3 
<211> 4193 

<212> DNA . . . ^ . 

<212> Bacillus megaterium + Heamenteria ghilxanii fusion gene 



<220> 

<22i> gene 

<222> (!) - - (4193) 

<223> Piasmxd PAW 4 

<220> 

<221> CDS 

<222> ;i4 1) . . (34 4) 

<223> Tridegin 



<220> 
<221> CCS 

<222> < 3B7 ) . . (1169) 

<222> Glucose Dehydrogenase 

<220> 
<221> CDS 

<222> i 1179} . . ( 1211) 

<223> Poly-histidine tag 
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<400> 3 

ccatcgaatg gccagatgat taattcc-aa tttttgttga cactccatca ttgatagact 60 

tattttacca ctccctatca gtgatagaga aaagtgaaac gaatagttcg acaaaaa:cc 120 

agataacgag ggcaatcgat atg aaa eta ttg cct zgc aaa gaa egg cat caa 173 

Met Lvs leu Leu Pro Cys lys Glu Trp Kis Gin 
1 5 ~ 10 



ggt act cct aac ccz agg ngc tgg tgt ggg get gar era gaa zgc gca 
Giy lie Pro Asn Pro Arg Cys Tro Cys Gly Ala Asp Leu Glu Cys Ala 
15 " 20 25 



221 



caa gac caa tac tgt gec ttc aca cct caa ret aga cca aga tea gaa 269 
Gin Asp 6ln Tyr Cys Ala Phe lie Pro Gin Cys Arg Pro Arg Ser Glu 
30 35 40 

ctg att aaa ccz atg gaz gat ana tac caa aga cea gtc gag "t cca 317 
Leu lie Lys Pro Met Asp Asp lie Tyr Gin Arg Pro Val Glu ?he Pro 
45 * 50 55 

aac czz cca tta aaa cct agg gag gaa agegctatga gaggatcgea 354 
Asn Leu Pro Leu Lys Pro Arg Glu Glu 
60 65 

tcaccatcac catcacccgc ag atg tat aca gac eta aaa gat aaa gta gtt 416 

Met Tyr Thr Aso Leu Lys Asp Lys Val Val 
70 75 

gta att aca ggt gga tea aca ggt tta gga cgc gca atg get att cgt 4 64 
Val lie Thr Gly Giy Ser Thr Giy Leu Gly Arc Ala Met Ala Val Arg 
80 " 35 90 . 

ttc ggt caa gaa gaa gca aaa gtt gtt att aac tat tac aac aat gaa 512 
=>he Glv Gin Glu Glu Ala Lys Val Val lie Asn Tyr Tyr Asn Asn Glu 
95 * 100 105 11C 

gaa gaa get eta gat gcg aaa aaa gaa gta gaa gaa gca ggc gga caa 560 
Glu Glu Ala Leu Aso Ala Lvs Lvs Glu Val Glu Glu Ala Gly Gly Gin 
115 * * 120, 125 

gca ate ate gtt caa ggc gat gta aca aaa gaa gaa gac gtt gta aa: 603 
Ala lie lie Val Gin Giy Aso Val Thr Lys Glu Glu Asp Val Val Asn 
130 * 135 i40 

ctt gtt caa aca get att aaa gaa ttt ggt aca tta gac gta atg att 556 
Leu Val Gin Thr Ala He Lys Glu Phe Giy Thr Leu Asp Val Met lie 
145 ' 250 155 

aac aac get ggt gtt gaa' aac cca gtt cct tct cat gag eta zzz eta . 704 
Asn Asn Ala Gly Val Glu Asn Pre Val Pro Ser His Glu Leu Ser Leu 
160 165 no 

gat aac tgg aac aaa gzz a:: gac aca aac zza aca ggt gca ttc tta 752 
Asp Asn Trp Asn Lys Val He Aso Thr Asn Leu Thr Gly Ala Phe Leu 
175 180 185 190 

gga age cgt gaa gca att aaa tac ttc gtt gaa aac gac att aaa gga 800 
Giy Ser Arg Glu Ala lie Lys Tyr Phe Val Glu Asn As? He Lys Giy 
195 200 205 
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aat gtt arc aac atg cct age get cac gaa acg at" cct egg cca -ta 843 
Asn Val lie Asn Met Ser 5er Val His Glu Met lie Pro Trp Pro Leu 
210 215 22*0 

tt t get cac tac gca gca ag-c aaa ggc ggt acg aaa eta atg acg gaa 896 
Phe Val His Tyr Ala Ala Ser Lys Gly Giy Met Lys Leu Mec Thr Glu 
225 230 " ' 235 

aca ctg get ctt gaa tat gcg cca aaa ggt att cgc gca aat aat att 944 
Thr Leu Ala Leu Glu Tyr Ala Pro Lys Gly lie Arg Val Asn Asn He 
240 245 250 

gga cca ggt gcg atg aac aca cca att aac cca gag aaa tec gca gat 992 
Gly Pro Gly Ala Met Asn Thr Pro lie Asn Aia Glu Lys Phe Ala Asp 
255 * 260 265 270 

cca gaa caa cgr gca gac gta gaa age atg act cca acg ggt tac ate 1040 
Pro Glu Gin Arg Ala Aso Val Glu Ser Met lie Pro Met Gly Tyr lie 
275 * 280 235 

ggt aaa cca gaa gaa gca gca cca cc t gca gca etc tea get tea tea 10 38 
Gly Lys Pro Glu Glu Val Ala Ala Val Aia Ala Phe Leu Ala Ser Ser 
290 295 300 

caa gca age cat gta aca ggt act aca tea etc gca gat ggc ggt atg 1136 
Gin Ala Ser Tyr Val Thr Giy lie Thr Leu Phe Ala Asp Gly Gly Met 
305 3IC 315 

acg aaa cac ccc tec etc caa gca gga aga ggc caatagagc get atg aga 1137 
Thr Lys Tyr Pro Ser Phe Gin Ala Gly Arg Giy Ala Mee Arg 

320 325 " 330 

gga teg cat cac cat cac cat cac eaatagaagc ttgacccgcg aaccgaaaaa 1241 
Gly Ser His His His His His His 
335 340 

eggegcaeat cgcgcgacac cccceccgec cgccrcccac cgceaccgcg ccacggacet 13CI 

ccacgcgccc tgtagcggcg caccaacccc ggcgggcgtg gtggttacgc geagegtgae 13 51 

cgctacactc gccagcgccc cagcgccegc tccttecgct ttcetccctt cctttcccgc 1421 

cacgctcgcc ggctttcccc gteaagctce aaateggggg ececctttag ggetccgact 1431 

cagtgettta cggcacceeg accccaaaaa acttgactag ggcgatggtt cacgtagtgg 1541 

gccatcgccc tgatagaegg tetttcgecc tttgacgttg gageccacgt cctteaatag 1601 

tggactctcg ctccaaactg gaacaacace caaccceacc ceggtctatt cttctgattt 1661 

acaagggatt ttgccgattt eggectattr gctaaaaaat gagctgattt aacaaaaatc 1721 

caacgegaat tctaacaaaa tactaacget tacaaettca ggtggcactt eteggggaaa 1731 

cgtgcgcgga acccctattt gtttattttt ceaaaeacat tcaaatatgt atccgctcat 1341 

gagacaataa ccctgataaa tgctecaata atattgaaaa aggaagacta tgagtattca 1901 

acatttccgt gccgccccta ttccctcttt tgcggcattt egccttcccg ettttgecea 1961 



o gi qi ± 3M& ■+ . o s ± a o 



8 

cccagaaacg ctggtgaaag taaaagaigc cgaagatcag ncgggtgcac gagcgggu-a 2021 

cazzqaaczq gacctcaaca gcggcaacac zzzzqaqaqz ctcccccccg aagaacgc:: 206 1 

cccaatgatg agcactrcca aagttctgcc acgrggcgcg gtaccatccc gcattgacgc 2141 

cgggcaagag caactcggtc gccgcataca ctattctcaq aangactitgg ttgagtactc 2201 

accagtcaca gaaaagcatc ttacggatgg cacgacagta agagaactat gcagtgctgc 2261 

cataaccatg. agtgataaca ctgcggccaa -tcacc'ccg acaacgatcg gaggaccgaa 2321 

ggagctaacc qczzzzzzqz acaacatggg ggatcacg-a aczcgcct-g a^cgrrggga 2381 

accggagctg aatgaagcca Zaccaaacqa cqagcqzqac accacgazqc crgcagcaat 24 41 

ggcaacaacg ttgcgcaaac -actaac-gg zqaaczaczz acrctagcr" cccggcaaca 2501 

atrgatagac ^ggatggagg cggataaagt cgcaggacca czzczqcgcz cggcccttcc 2561 

ggcrggcrgg zztazzqczq azaaazczqq aqocqqzqaq cqzqqcZczz qcqqzazcaZ 2621 

rgcagcac-g gggccagatg gtaagccc:: ccg-atcgza gt-atc-aca cgacggggag 2631 

ccaggcaact atggatgaac gaaaiagaca qazcqzzqaq atagg-gcct. cactga^raa 2741 

qcazzqqzaq gaazzaazqa zqzzzcqzzz agacaaaagt aaaqzqazza acagcgcat: 2301 

agagcrgctt aatgaggtcg gaatcgaagg :::aacaacc cgtaaacrcg cccagaagct 2961 

aggzg-cagag cagccracac tgnatrggca :g-aaaaaa: aagcgggcct: rgczcgacgc 2S21 

czzaqccazz gagatgtcag ataggcacca zaczcaczzz zqcczzzzac aaggggaaag 2981 

c-ggcaacat tttn-acg^a ataacgc^aa aag" "aga Zqzqczzzac -aagtcaccg 3041 

cganggagca aaag-acat: taggtacacg gcccacagaa aaacagtacg aaaczzzzqa 3101 

aaaccaacca gcc:::::ac gccaacaacg ,::::i::acc2 gagaacgcac zazazqcacz 3161 

cagcgcagtg gggca'ttra cnt-agg-ig zqzazzqqaa gazcaagagc accaag-cgc 3221 

raaagaacaa acggaaacac c:ac:ac:ca zaqzazqzcq ccaz-ar-ac gacaagctat 3281 

cgaatmatct gazcaccaag grgcagagcc aqczzzczza. "cggcctcg aantgaccat 334 1 

a"gcggacta gaaaaacaac tcaaatgTiga aagcgggrct- taaaagcagc ataaccttrc 3401 

zccgrgatgg -aac'tcac: agftnaaaag gatctagg'g aagatccr;: ;tga:aatcc 3461 

catgaccaaa azcczz :aac g'gagtc.ic qzzzcazzqa gcgtcagacc ccg'agaaaa 3521 

gaccaaagga -cr-ct-gag atcctc:::: nr-gcgcgta atzzqozccz -gcaaacaaa 3581 

aaaaccaccg c:accagcgg zqqzzzqz z z gccgga~caa gagctaccaa zzczzzzzcz 364 1 

gaaggtaact qczzzzaqca gagcgcaga* accaaatac: grcc^tcrag tgtagccgca 3701 

qzzaqqzzac cacttcaaga ac;c:g:acc azzzczzaca zazzzzqczz tgccaaccct 3761 

gttaccagtg qzzcctqcza grggcgacaa qzzqzqzczz accgggttgg actcaagacg 3321 
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acagtraccg 


gataaggcgc 


agcggtcggg 


czgaacgggg 


ggt.ncgt:gca 


cacagcccag 


33 31 


c-tggagcga 


acgacctaca 


ccgaaczgag 


acacccacaq 


cgt.gagc^at: 


gagaaagcgc 


3941 


cacgcctccc 


gaagggagaa 


aggcggacag 


gnarccggca 


agcggcaggg 


tcggaacagg 


4001 


agagcgcacg 


agggagcttc 


cagggggaaa 


cgcctggcat 


crtitaragrc 


crgtcgggtt 


4061 


tcgccacctc 


tgacrtgagc 


gtcgattztn 


grgarccrcg 


ccaggggggc 


ggagcctiarg 


4121 


gaaaaacgcc 


agoaacgcgg 


cctttrtacg 


gt tcctggcc 


-tr-gctggc 


ctttrgctca 


4181 


cacgacccga 


ca 










4193 



<210> 4 
<211> 340 

<212> PRT . 
<213> 3acillus megaterium * Hsa.T.enter;a ghiiianii fusion protein 

'<400> 4 

Met Lys Leu Leu Pre Cvs Lys Glu Ttd His Gin Glv lie ?rc Asn Pro 
1 5 10 15 ■ 

Arg Cvs Trp Cys Glv Ala Asp Leu Glu Cys Ala Gin Asp Gin Tyr Cys 
20 " 25 30 

Ala ?he lie Pro Gin Cys Arg Pro Arg Ser Glu. Leu lie Lys Pro Met 
35 4G 45 

Aso Asp lie Tyr Gin Arg Pro Vai Gi-u Phe Pro Asn Leu Pro Leu Lys 
50 " 55 SO 

Pro Arg Glu Glu Met Tyr Thr Asp Leu Lys As? Lys Vai Val 
55 70 

Val lie Thr Glv Giy Ser Thr Gly Leu Gly Arg Ala Mec Ala Vai Arg 
30 -35 9C 

Phe Glv Gin Glu Glu Ala Lys Vai Vai lie Asr. Tyr Tyr Asn Asn Giu 
95 * 100 105 110 

Glu Glu Ala Leu Aso Ala Lvs Lys Glu' Vai Glu Giu Ala Giy Gly Gin 
115 " ' 120 125 

Ala lie He Vai Gin Glv Aso Val Thr Lys Glu Glu Asp Vai Vai Asn 
130 135 140 

Leu Vai Gin Thr Ala lie Lys Giu Phe Giy Thr Leu As? Val Met lie 
145 150 155 

Asn Asn Ala Giy Val Giu Asn Pro Vai ?r: Ser His Giu Leu Ser Leu 
160 165 17 0 

Aso Asn Trp Asn Lvs Val lie Asp Thr Asn Leu Thr Gly Ala Phe Leu 
175 * 190 135 190 



Giy Ser Arg Glu Ala lie 
195 



Lvs Tyr Phe Val Giu Asn As? lie Lys Giy 
200 205 
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Asp. Val He Asn Met Ser Ser Val His Glu Met He Pro Trp Pro Leu 
210 215 220 

Phe Val His Tyr Ala Aia 5er Lys Gly Gly Met Lys Leu Met Thr Glu 
225 * 230 235 

Thr Leu Ala Leu Glu Tyr Ala Pro Lys Gly He Arg Val Asa Asn lie 
240 245 250 

Gly Pro Gly Ala Met Asn Thr Pro He Asn Ala Glu Lys Phe Ala Asp 
255 " .260 265 270 

Pro Glu Gin Arg Ala As? Val Glu Ser Met lie Pro Met Gly Tyr lie 
275 " 280 295 

Gly Lys Pro Glu Glu Val Ala Aia Vai Aia Aia Phe Leu Ala Ser Ser 
290 295 300 

Gin Ala Ser Tvr Vai Thr Gly He Thr Leu Phe Aia As? Gly Gly Met 
305 " 310 315 



Thr Lys Tvr Pro Ser Phe Gin Ala Gly Arg Gly Ala Met Arg Gly Ser 

320 " 325 330 

His His His His His His 
335 340 



<210> 5 
<2il> 32 
<212> DNA 

<213> Artificial sequence 
<220> 

<22l> pri^ier_bind 
<222> Ui . • (32) 
~* <223> Primer 1, GlcCH 

<220> 

<223> Description of the artificial sequence: primer 
<400> 5 

gcgcgaactc atgtatacag attcaaaaaq ac 

<210> 6 
<211> 31 
<212> DNA . 

<213> Artificial sequence 
<220> 

<221> prime rjsind 

<222> (1) . . (311 

<223> Primer 2, GicDH 

<220> 

<223> Description of the artificial sequence: primer 
<4CC> 6 

gcgcttcgaa ctattagcct zzzcczzczz g 



09 gii. 



9H 
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<210> 7 
<2I1> 31 
<2I2> ON A 

<213> Artificial sequence 

lllll Description of the artificial sequence: primer 
<40Q> 1 

gcgcctgcag atg^acacag atrtaaaaga r. 



<210> 3 
<211> 31. 
<2I2> DMA 

<213> Artificial sequence 
<220> 

<221> prii?.er_binc 

<222> "(1) • . nu 

<223> Primer 4, GlcDH 

<C2 20 > 

<223> Description of the artificial sequence: primer 
<400> 8 

gcgcagccct crattag=c- czzoczqczt g 



<210> 9 
<211> 31 
<212> DNA 

<213> Artificial sequence 
<220> 

<221> primer__bind 

<222> . . (31) 

<223> PriT.er 5, iriaegir. 

<220> 

<223> Description of the artificial sequence: primer 
<400> 9 

gcgca"c;an atgaaacra: -gect-gcaa a 



<210> 10 

<211> 31 

<212> DNA 

<213> Artificial sequence 



<220> 

< 2 2 1 > primer_bind 

<222> "(1) . . (31) 

<223> Primer 6, Tridegin 



<223> Description of the artificial sequence: primer 
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<400> 10 

gcgcctgcag gtgatggtga Lggcgatgcg a 



<210> 11 
<211> 22 
<212> DNA 

<213> Artificial sequence 
<220> 

<221> primerjbind 

<222> .*(!)". - (22) 

<223> Primer 7, pASK 75UPN 



31 



<220> 

<223> Description of the artificial sequence: primer 

<400> 11 22 
ccaccgaatg gccagaLgat :a 



<210> 12 
<2I1> 21 

<212> DNA * 
<213> Artificial sequence 

<220> 

<221> prirner_bind 
<222> "(1) - • (21) 
<223> pASK 7 5 RPN 

<220> 

<223> Description of the artificial sequence: primer 

<400> 12 ?! 
-agcgczaaa cggcagacaa a 

<210> 13 
<211> 20 
<212> DNA 

<213> Artificial sequence 
<22C> 

<221> prime r_binc 

<222> (I) . . (20) 

<223> Primer 9, T7 Seq. 

<223> Description of the artificial sequence: primer 

<400> 13 20 
taatacgact cactataggg 



<210> 14 

<211> 13 

<212> DNA 

<213> Artificial sequence 



<220> 



t,J *"# J 
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<22I> primer__bind 
<222> (1) . . (18) 
<223> Rev. Sec. 

<220> 

<223> Description of the artificial sequence: primer 



<400> 14 

tagaaggcac agrcgagg 



ia 



